Amendments to the Specification; 



Please replace the title beginning at page 1 , line 1 with the following replacement title: 

--GENE [ CLUSTER ] ENCODING A NONRIBOSOMAL PEPTIDE SYNTHETASE FOR 
THE PRODUCTION OF RAMOPLANIN [ B I OSYNTHES I S ]- 

Please replace the abstract paragraph with the following replacement paragraph: 

- The present invention relates to an isolated genetic sequence[s] encoding a 
nonribosomal peptide synthetase (NRPS) protein[s] which directs the biosynthesis of 
the antibiotic ramoplanin in microorganisms such as Actinoplanes so . The isolated 
[ bioGynthot i c] gene [c l ust e r] sequence serves as a substrate for bloengineering of 
antibiotic structures.-- 

Please replace the paragraph [0012] beginning at page 4, line 1 with the following 
replacement paragraph: 

~ In one embodiment preferred nucleic acids encode at least two, more preferably 
three, still more preferably four, or most preferably fjye or more ORFs selected from 
ORFS 1 to 33 (SEQ ID NOS: 2 to 34) of the ramoplanin locus. In one embodiment, 
combinations of ORFs selected from ORFs 1 through 33 (SEQ ID NOS 2 to 34) are 
provided which encode polypeptides that form at least the depsipeptide core structure of 
ramoplanin. In another embodiment combinations of ORFs selected from ORFs 1 
through 33 (SEQ ID NOS: 2 to 34) are provided which encode polypeptides that form at 
least the fatty-aid side chain of the depsipeptide core structure of ramoplanin. In 
another embodiment, combinations of ORFs selected from ORFs 1 through 33 (SEQ ID 
NOS: 2 to 34) are provided which encode polypeptides responsible for the synthesis of 
4-hydroxyphenylglycine (HPG) of ramoplanin. In another embodiment, combinations of 
ORFs selected from ORFs 1 through 33 (SEQ ID NOS: 2 to 34) are provided that 
encode polypeptides that form at least the beta-hydroxyasparagine residue. In another 
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embodiment, combinations of ORFs selected from ORFs 1 through 33 (SEQ ID NOS: 2 
to 34) are provided which are involved in the regulation of ramoplanin biosynthesis. In 
another embodiment, combinations of ORFs selected from ORFs 1 through 33 (SEQ ID 
NOS: 2 to 34) are provided which encode polypeptides that are involved in resistance 
and subcellular localization of the ramoplanin biosynthetic machinery. A single ORF or 
a combination of ORFs selected from ORFs 1 through 33 (SEQ ID NOS: 2 to 34) are 
provided to enhance production of ramoplanin by altering the expression level of an 
ORF selected from ORFs 1 through 33 (SEQ ID NOS: 2 to 34). In another embodiment, 
the expression level of an ORF selected from ORFs 1 through 33 (SEQ ID NOS: 2 to 
34) may be altered to increase the yield of a particular form of ramoplanin.-- 

Please replace the paragraph [0028] beginning at page 8, line 29 with the following 
replacement paragraph: 

-- Figure 3A is a clustal analysis of adenylation domains of ramoplanin biosynthetic 
enzymes (amino acids 471-959 of SEQ ID N0:13 (ORF 12). amino acids 518-990 of 
SEQ ID N0:14 (ORF 13 Ml), amino acids 1561-2052 of SEQ ID N0:14 (ORF 13 M2). 
amino acids 2619-3122 of SEQ ID NO: 14 (ORF 13 M3). amino acids 3698-4160 of SEQ 
ID N0:14 (ORF 13 M4). amino acids 4719-5192 of SEQ ID N0:14 (ORF 13 M5). amino 
acids 6318-6804 of SEQ ID N0:14 (ORF 13 M7). amino acids 487-993 of SEQ ID 
N0:15 (ORF 14 Ml), amino acids 1568-2041 of SEQ ID N0:15 (ORF 14 M2). amino 
acids 2603-3095 of SEQ ID N0:15 (ORF 14 M3). amino acids 3672-4135 of SEQ ID 
N0:15 (ORF 14 M4). amino acids 4699-5199 of SEQ ID N0:15 (ORF 14 M5). amino 
acids 5777-6280 of SEQ ID NO: 15 (ORF 14 M6). amino acids 6840-7343 of SEQ ID 
N0:15 (ORF 14 M7). amino acids 7926-8380 of SEQ ID N0:15 (ORF 14 M8). and 
amino acids 309-804 of SEQ ID NO: 18 (ORF 17). as defined In table 3) . Shown is the 
alignment of the amino acid sequence (single letter code) of all adenylation domains 
found in the ramoplanin locus relative to the adenylation domain of gramicidin S 
synthetase GrsA (SEQ ID NO:35) . Adenylation domains of multimodular non-ribosomal 
peptide synthetases 0RF13 and 0RF14 are labeled according to their corresponding 
module M1-M7 and M1-M8, respectively. Note that 0RF13 does not contain an 
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adenylation domain in module 6. Highly conserved core motifs A1-A10 of adenylation 
domains (Konz et al., 1999, Chem. Biol. Vol. 6, pp. R39-48) are highlighted by boxes. 
Key residues used to predict the substrate specificity of each adenylation domain are 
highlighted in black (see Figure 3B).- 

Please replace the paragraph [0030] beginning at page 9, line 24 with the following 
replacement paragraph: 

-- Figure 3C shows the similarity between ORF26 fSEQ ID NO:27) and acyl-CoA 
ligases. Shown is the clustal analysis of ORF 26 versus several acyl-Coenzyme A 
ligases from diverse species: Mb, Mycobacterium bovis (SEQ ID NO:36) : Mt, 
Mycobacterium tuberculosis (SEQ ID NO:37) : Sv, Streptomyces verticillus (SEQ ID 
NO:38) : Mx, Mvxococcus xantiius (SEQ ID NO:39) : Bs, Bacillus subtilis (SEQ ID 
NO:40) . Highlighted by boxes are the highly conserved core motifs AL1-AL8 of acyl- 
CoA ligases [ as dosor i bod by Du ot al., 2000 ].-- 

Please replace the paragraph [0032] beginning at page 10, line 15 with the following 
replacement paragraph: 

~ Figure 5 illustrates two clustal alignments. Figure 5A shows the local amino acid 
sequence homology between ORF 10 (amino acids 263 to 31 8 of SEQ ID NO: 11) and 
a key motif found in pfam 00753 (SEQ ID N0:41) involved in coordinating two zinc 
molecules in the beta-lactamase superfamily. (For information regarding the Pfam 
Families Datebase, see Bate man etal. Nucleic Acids Research, 2000, Vol. 28, No. 1 , 
263-266 ). 1 SML (SEQ ID NO:42) represents one member of this superfamily for which 
a crystal structure showing the intimate interaction between the zinc molecule and the 
highlighted residues is available (Ullah et al., J. Mol Biol., 1998 Nov 20; 284(1): 125-36). 
Figure 5B shows the local amino acid sequence homology between ORF 10 ( amino 
acids 405 to 452 of SEQ ID NO: 1 1 ) and a key motif found in pfam 00067 (SEQ ID 
NO:43) involved in coordinating an iron molecule in cytochrome P450 
monooxygenases.-- 
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Please replace the paragraph [0080] beginning at page 25, line 9 with the following 
replacement paragraph: 

-- Methods of cloning and expressing large nucleic acids such as gene clusters, 
including NRPS-encoding gene clusters, in cells including Streptomyces are well known 
to those skilled in the art (see, e.g., Stutzman-Engwall and Hutchinson (1989) Proc. Ntl. 
Acad. ScL USA, 86: 3135-3139: Motamedi and Hutchinson (1987) Proc. Natl. Acad. Sci. 
USA, 84: 4445-4449; Grimm etaL (1994) Gene, 151 : 1-10; Kao etal. (1994) Science, 
265 : 509-512; and Hopwood etal. (1987) Meth. EnzymoL, 153: 116-166). In some 
examples, nucleic acid sequences of well over 100 kb have been introduced into cells, 
including prokaryotic cells, using vector-based methods (see for example, Osoegawa et 
al., (1998) Genomics, 52: 1-8; [W^] Huang etal., (1996) Nucl. Acids, Res., 24: 4202- 
4209). - 

Please replace the paragraph [01 15] beginning at page 47, line 8 with the following 
replacement paragraph: 

~ Yet another explanation is that specialized domains within the NRPSs may have 
evolved the ability to carry out dual functions. One domain that stands out as a 
candidate for having such dual functions is the condensation domain. Normally within a 
typical NRPS module that introduces a D-amino acid into the peptide product, 
epimerization (E) domains follow the thiolation (T) domain. In terms of linear domain 
organization on NRPS enzymes condensation (C) domains and epimerization (E) 
domains can be thought of occupying equivalent positions. That is, in an NRPS with 
multiple modules that is devoid of E domains, a C domain from any given module is 
found directly adjacent to the thiolation (T) domain of the upstream module, in addition, 
C domains and E domains also share a considerable amount of sequence similarity. 
Several highly conserved core motifs are shared between C and E domains. One 
particularly important motif that is common to both C and E domains is the histidine 
motif HHXXXDG (SEP ID NO: 44) which has been shown by mutagenesis to form part 
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of the active site (Stachelhaus etal.\ Journal of Biological C/?em/sf/y 1998;273:22773- 
22781). Thus, the C domains of modules 2, 3, 4 and 6 of OFR 13 (SEQ ID NO: 14) and 
modules 2, 4 and 8 of ORF 14 (SEQ ID NO: 15) may be capable of amino acid 
epimerization as well as amide bond formation and be responsible for the 7-D-amino 
acid residues found in ramoplanin. 

Please replace the paragraph [0123] beginning at page 51 , line 3 with the following 
replacement paragraph: 

-- Figure 5 illustrates clustal alignments showing sequence homology between ORF [44] 
10 (SEQ ID NO: [42] H) and various metal ligand motifs. In each of the clustal 
alignments: (i) a line above the alignment is used to mark strongly conserved positions; 
(ii) an asterisk indicates positions which have a single, fully conserved residues; (iii) 
a colon ":" indicates that one of the following strong groups is fully conserved: S^T or A; 
N^E^Q or K; N^H^Q or K; N^D^E olQ; Q^H^R olK; M^LL olV; M^LL olF; H olY; 
and Fj_Y olW; and (iv) a period "." indicates that one of the following weaker groups is 
fully conserved: C^S or A; A^T or V; S^A olG; S^T^N or K; S^T^P or A; S^G^N or D; 
S^N^D^E^Q or K; N^D^E^Q^H or K; N^E^Q^H^R or K; F^V^L^I or M: and H^F or Y.-- 

Please replace the paragraph [0124] beginning at page 51, line 1 1 with the following 
replacement paragraph: 

ORF 10 (SEQ ID NO: 11) contains two amino acid sequence motifs that are 
frequently found in enzymes that use metal cofactors. The N-terminal region of ORF 10 
(SEQ ID NO: 11) contains a cluster of histidine residues (the His-motif) that shows 
significant local sequence homology to a conserved histidine motif found in several zinc- 
binding beta-lactamases. Figure 5A shows the local amino acid sequence homology 
between ORF 10 (SEQ ID NO: 11) and a key motif involved in coordinating two zinc 
molecules in the beta-lactamase superfamily. The alignment depicts amino acids 263 to 
318 of ORF 10 (SEQ ID NO: 1 1), amino acids 42 to 99 of a member of the beta- 
lactamase superfamily, the L1 metallo-beta-lactamase (1SML) from Stenotropliomonas 
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ma/topMa for which the crystal structure has been determined (Ullah etal., 1998, J. 
Mol. Biol.. 125-136 ). and amino acids 12 to 67 of the consensus sequence for 
pfam00753, i.e. the beta-lactamase superfamily motif (Bateman et al., 2000 . Nucleic 
Acids Research. Vol. 28. No. 1. 263-266 ). Highlighted in black are residues 
demonstrated in the LI metallo-beta-lactamase to co-ordinate zinc and their 
counterparts in the other two sequences. X-ray crystal structure analysis demonstrates 
that the histidine residues in this conserved motif are responsible for binding the zinc 
metal cofactor (Ullah etal., 1998. J. Mol. Biol.. 125-136 ). The precise alignment and 
conserved spacing of the amino acid residues in the His-motif of ORF 10 (SEQ ID NO: 
1 1) as compared to the zinc-binding beta-lactamases indicates that ORF 10 (SEQ ID 
NO: 11) is likely to bind a metal cofactor. ~ 

Please replace the paragraph [0125] beginning at page 51 , line 30 with the following 
replacement paragraph: 

" Figure 5B shows the local amino acid sequence homology between ORF 10 (SEQ ID 
NO: 11) and a key motif involved in coordinating an iron molecule in cytochrome P450 
monooxygenases. The alignment depicts amino acids 405 to 452 of ORF 10 (SEQ ID 
NO: 1 1) and amino acids 370 to 421 of the consensus sequence for pfam00067, i.e. the 
cytochrome P450 motif (Bateman et al., 200 0. Nucleic Acids Research. Vol. 28. No. 1 . 
263-266 ). The region of ORF 10 (SEQ ID NO: 1 1) in highlight is in relatively good 
agreement with the Prosite motif PS00086 (Hofmann etal., 1999 . Nucleic Acids 
Research. Vol. 27. No. 1. 215-219 ) required for binding iron: [FW]-[SGNH]-x-[GD]-x- 
[RKHPT]-x-C-[LIVMFAP]-[GAD], where x is [ any amino ac i d ] a cap and amino acids in 
brackets indicate the variability in a given position. Notably, the least variable positions 
of this motif are present in ORF10 (SEQ ID NO: 11), i.e. residues Phe-423, Gly-425, 
Cys-428, and Gly-430). The C-terminal region of ORF 10 (SEQ ID NO: 11) contains a 
cluster of amino acid residues that shows significant local sequence homology to a motif 
frequently found in cytochrome P450 monooxygenases (the Cys-motif). This motif 
includes a cysteine residue that is highly conserved in the cytochrome P450 
monooxygenases and that has been shown by X-ray crystal structure analysis to be 
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involved in binding the iron metal cofactor required for catalysis. The Cys-motif of ORF 
10 (SEQ ID NO: 1 1) is likely to contribute to the binding of a metal cofactor. The 
presence of two amino acid sequence motifs that are found in well-characterized metal- 
binding enzymes indicates that ORF 10 (SEQ ID NO: 1 1) is likely to be a metal-binding 
enzyme. Thus, the ORF 10 (SEQ ID NO: 1 1) is likely to be responsible for the 
formation of beta-hydroxyasparagine during the synthesis of ramoplanin.-- 

Please replace the paragraph [0130] beginning at page 53, line 28 with the following 
replacement paragraph: 

~ As an alternative source of native ORF 10 (SEQ ID NO: 1 1), a Streptomyces 
expression system was employed. ORF 10 (SEQ ID NO: 11) was amplified by high 
fidelity PGR using two specific oligonucleotides, namely primer sequences (5' to 3') N- 
oligo: CACACAGAATTCACCAGCGCCACTCGCGCTT (SEQ ID NO:45) . and C-oligo: 
CACACATCGATGGGCAACGCCGATCAGCCG fSEQ ID NO:46) . This primer pair 
introduces convenient restriction enzyme sites at either end of the ORF 10 gene but 
does not introduce any exogenous amino acids. The amplified genes were then 
subcloned using C/al and EcoRI restriction enzymes into a Streptomyces/E.coW 
expression shuttle vector, pECO1202. Following confirmation of the cloned sequences, 
Streptomyces lividans TK24 was transformed with this construct. Five independent 
transformants were selected for further analysis. Cultures were grown for 48 hours in a 
gyrating 30°C incubator using 25 ml erienmeyer flasks containing 5 ml of Tryptic Soy 
Broth (TSB, Difco). Total RNA was extracted from the cell pellets using the RNeasy kit 
(Qiagen). The integrity and concentration of the RNA was monitored by agarose gel 
electrophoresis. Subsequently, reverse transcription was performed using 1 ug total 
RNA primed with an antisense primer sequence located in the vector just downstream 
of the stop codon. Following reverse transcription of each sample and appropriate 
controls, 20 cycles of PGR were performed using the original ORF-specific 
oligonucleotides, N-oligo and C-oligo. According to the RT-PCR analysis, the five 
recombinant S. lividans clones express relatively high levels of ORF 10-specific mRNA 
and the size of the RT-PCR product is as expected. Figure 6 shows the RT-PCR 
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analysis of recombinant S. lividans clones expressing ramoplanin ORF 10, wherein is 
lane 1 is 1 kb DNA ladder; lane 2 is untransformed S. lividans] lane 3 is S. lividans 
transformed with empty expression vector; lanes 4-8 are five different S. lividans 
recombinant clones expressing ramoplanin orf 10; lane 9 is an S. lividans recombinant 
clone expressing an unrelated gene; lane 10 is negative control performed without RNA; 
lane 1 1 is negative control performed without RT; lane 12 is positive control for PGR 
using plasmid DNA.-- 

Please replace Table 2 on pages 35 -41 with the following replacement Table: 
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